NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hyperparameter tuning via trajectory predictions: stochastic prox-linear methods in matrix sensing

https://doi.org/10.1007/s10107-025-02279-0

Lou, Mengqi; Verchand, Kabir_Aladin; Pananjady, Ashwin (September 2025, Mathematical Programming)

Abstract Motivated by the desire to understand stochastic algorithms for nonconvex optimization that are robust to their hyperparameter choices, we analyze a mini-batched prox-linear iterative algorithm for the canonical problem of recovering an unknown rank-1 matrix from rank-1 Gaussian measurements corrupted by noise. We derive a deterministic recursion that predicts the error of this method and show, using a non-asymptotic framework, that this prediction is accurate for any batch-size and a large range of step-sizes. In particular, our analysis reveals that this method, though stochastic, converges linearly from a local initialization with a fixed step-size to a statistical error floor. Our analysis also exposes how the batch-size, step-size, and noise level affect the (linear) convergence rate and the eventual statistical estimation error, and we demonstrate how to use our deterministic predictions to perform hyperparameter tuning (e.g. step-size and batch-size selection) without ever running the method. On a technical level, our analysis is enabled in part by showing that the fluctuations of the empirical iterates around our deterministic predictions scale with the error of the previous iterate.
more » « less
Accurate, provable and fast nonlinear tomographic reconstruction: A variational inequality approach

Lou, Mengqi; Verchand, Kabir; Fridovich-Keil, Sara; Pananjady, Ashwin (May 2025, The 18th International Meeting on Fully 3D Image Reconstruction in Radiology and Nuclear Medicine)

Free, publicly-accessible full text available May 30, 2026
Computationally efficient reductions between some statistical models

Lou, Mengqi; Bresler, Guy; Pananjady, Ashwin (February 2025, Proceedings of Machine Learning Research)

Free, publicly-accessible full text available February 27, 2026
Efficient reductions between some statistical models

Lou, Mengqi; Bresler, Guy; Pananjady, Ashwin (February 2025, The 36th International Conference on Algorithmic Learning Theory)

Free, publicly-accessible full text available February 24, 2026
Alternating minimization for generalized rank-1 matrix sensing: sharp predictions from a random initialization

https://doi.org/10.1093/imaiai/iaae025

Chandrasekher, Kabir_Aladin; Lou, Mengqi; Pananjady, Ashwin (September 2024, Information and Inference: A Journal of the IMA)

Abstract We consider the problem of estimating the factors of a rank-$$1$$ matrix with i.i.d. Gaussian, rank-$$1$$ measurements that are nonlinearly transformed and corrupted by noise. Considering two prototypical choices for the nonlinearity, we study the convergence properties of a natural alternating update rule for this non-convex optimization problem starting from a random initialization. We show sharp convergence guarantees for a sample-split version of the algorithm by deriving a deterministic one-step recursion that is accurate even in high-dimensional problems. Notably, while the infinite-sample population update is uninformative and suggests exact recovery in a single step, the algorithm—and our deterministic one-step prediction—converges geometrically fast from a random initialization. Our sharp, non-asymptotic analysis also exposes several other fine-grained properties of this problem, including how the nonlinearity and noise level affect convergence behaviour. On a technical level, our results are enabled by showing that the empirical error recursion can be predicted by our deterministic one-step updates within fluctuations of the order $$n^{-1/2}$$ when each iteration is run with $$n$$ observations. Our technique leverages leave-one-out tools originating in the literature on high-dimensional $$M$$-estimation and provides an avenue for sharply analyzing complex iterative algorithms from a random initialization in other high-dimensional optimization problems with random data.
more » « less
Computationally Efficient Reductions Between Some Statistical Models

https://doi.org/10.1109/TIT.2025.3587354

Lou, Mengqi; Bresler, Guy; Pananjady, Ashwin (September 2025, IEEE Transactions on Information Theory)

Free, publicly-accessible full text available September 1, 2026
Do Algorithms and Barriers for Sparse Principal Component Analysis Extend to Other Structured Settings?

https://doi.org/10.1109/TSP.2024.3421618

Wang, Guanyi; Lou, Mengqi; Pananjady, Ashwin (January 2024, IEEE Transactions on Signal Processing)

Full Text Available

Search for: All records